Network Harness: Metropolis Public Transport 
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We analyze the public transport networks (PTNs) of a number of major cities of the world. While 
the primary network topology is defined by a set of routes each servicing an ordered series of given 
stations, a number of different neighborhood relations may be defined both for the routes and the 
stations. The networks defined in this way display distinguishing properties, the most striking being 
that often several routes proceed in parallel for a sequence of stations. Other networks with real- 
world links like cables or neurons embedded in two or three dimensions often show the same feature 
- we use the car engineering term harness for such networks. Geographical data for the routes reveal 
surprising self-avoiding walk (SAW) properties. We propose and simulate an evolutionary model of 
PTNs based on effectively interacting SAWs that reproduces the key features. 

PACS numbers: 89.75.Hc, 89.75.Da, 89.40.Bb 



INTRODUCTION 

Taken the general interest in networks of man-made 
and natural systems , it is remarkable that one of the 
most commonly encountered networks, the urban public 
transport network (PTN), is much less studied. The PTN 
constitutes an example of a transportation network and 
shares general features of these systems: evolutionary 
growth, optimization, embedding into two-dimensional 
(2D) space. However, as compared with other trans- 
portation networks like airport networks ^> HI > railway 
networks , or power grid networks 0, E | much less is 
known about the statistical and topological properties of 
PTNs. 

The few studies that have considered these questions 
so far have either been performed on a subnetworkftram, 
subway) of a specific city like Boston or Vienna [y, 0> 
or they were limited by the number of cities or their 
size. The recent thorough study of the bus and tram 
networks of 22 Polish cities considered PTNs with up to 
2811 stations Our preliminary work on some larger 
networks was restricted to only three cities (Berlin, 
Diisseldorf, and Paris). The present picture that emerges 
from this research views PTNs as networks with small- 
world properties and hierarchical organization as derived 
from various network characteristics and their correla- 
tions [9j. While indications for scale-free behavior have 
been found these depend strongly on the interpretation 
of the network. 

The present article is based on a survey of selected ma- 
jor cities of the world with PTN sizes ranging between 
2000 and 46000 stations. Simulations of an evolutionary 
growth model based on self-avoiding walks that we pro- 
pose appear to reproduce most of the key features of these 
PTNs. In addition to the standard characteristics of com- 
plex networks like the number of nearest neighbors, mean 



path length, and clustering we observe features specific 
to PTNs due to their embedding in geographic space con- 
strained by city structure. The most striking being that 
often several routes proceed in parallel on a common road 
or track for a sequence of stations. While other networks 
with real-world links like cables or neurons embedded 
in two or three dimensions often show similar behavior, 
these can be studied in detail in our present case. 




FIG. 1: PTN graph representations. L -space: filled circles 
represent stations serviced by two different routes shown by 
a bold and a thin line. L-space: reduction of L to a simple 
graph with single links. P-space: any two stations that are 
serviced by a common route are linked. B-space: stations are 
linked to the routes (open circles) they belong to. C-space: 
routes are connected when they share a common station. 

From maps of PTNs it is obvious that routes in gen- 
eral do not follow the shortest path between their termi- 
nal stations but rather meander through neighborhoods 
and between sites of interest. We quantify this behavior 
by analyzing geographical data. This leads us to model 
the routes as self-avoiding walks that efficiently cover the 
surface while the sites of interest introduce an effective 
attraction between them. 

To work out the specific features of PTNs, it is natu- 
ral to interpret the public transport system of a city in 
terms of graphs, see Fig. ^ The primary network rep- 
resentation, which we call L -space is defined by a set of 
routes each servicing an ordered series of stations. For 
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the PTNs studied so far, two different neighborhood re- 
lations were used that lead to different representations. 
In the first one, called L-space, two stations are defined 
as neighbors only if one station is the successor of the 
other one in the series of stations serviced by a route • 
In the second one, the P-space, two stations are neigh- 
bors whenever they are serviced by a common route |4|. 
Some of the specific features found in PTNs are more 
naturally described in additional representations that we 
introduce. A bipartite graph, B-space, is constructed by 



representing both routes and stations as nodes of differ- 
ent type as depicted by filled and open circles in Fig. 
H The one-mode projection of the B-space graph to the 
station nodes leads back to the P-space representation, 
whereas the corresponding projection to the set of routes 
results in a complementary C-space graph. Note that L 
differs from L only by the presence of multiple links. In a 
similar fashion one may define additional primed spaces 
P , C by keeping multiple links also in P and C 



City 


N 


M 


K 




7 


k 


k/k 


7p 


kp 




C 


C 


C p 


c P 


£ 


I 




e P 


Berlin 


2997 


218 


3.16 


80.2 


(4.30) 


1.24 


0.49 


(5.86) 


38.5 


0.70 


0.08 


91.58 


0.81 


43.22 


88 


21.60 


6 


3.10 


Dallas 


7163 


131 


2.22 


136.3 


4.99 


(1.01) 


0.49 


(4.67) 


76.9 


0.77 


0.01 


37.87 


0.97 


63.34 


269 


85.80 


10 


3.78 


Diisseldorf 


1615 


124 


3.20 


90.3 


(3.99) 


1.12 


0.44 


(4.63) 


58.8 


1.02 


0.04 


22.91 


0.79 


20.99 


56 


13.18 


5 


2.58 


Hamburg 


8159 


708 


3.25 


78.4 


(4.70) 


1.47 


0.56 


(4.92) 


55.6 


1.11 


0.08 


255.59 


0.82 


133.99 


158 


39.71 


11 


4.78 


Hong Kong 


2117 


321 


5.22 


230.1 


(3.04) 


2.60 


0.72 


(4.40) 


125.0 


1.01 


0.16 


92.33 


0.73 


12.51 


60 


11.11 


4 


2.26 


Istanbul 


4043 


414 


2.69 


140.1 


4.04 


(1.13) 


0.49 


(2.70) 


71.4 


0.93 


0.03 


44.99 


0.79 


41.54 


131 


29.69 


6 


3.09 


London 


11026 


2005 


3.23 


213.9 


4.58 


(1.46) 


0.48 


4.39 


(142.9) 


1.27 


0.16 


658.15 


0.70 


78.51 


77 


22.03 


6 


3.19 


Los Angeles 


46244 


1893 


2.73 


154.4 


4.88 


(1.50) 


0.64 


3.92 


(200.0) 


2.07 


0.03 


599.68 


0.90 


430.68 


247 


43.55 


14 


4.60 


Moscow 


3756 


679 


7.94 


129.7 


(3.31) 


2.12 


0.65 


(2.91) 


50.0 


0.78 


0.11 


129.78 


0.74 


43.14 


28 


7.07 


5 


2.52 


Paris 


4048 


232 


6.41 


79.5 


2.61 


(3.24) 


0.94 


3.70 


(100.0) 


2.07 


0.07 


86.78 


0.88 


72.88 


47 


7.22 


5 


2.79 


Rome 


6315 


681 


3.02 


86.4 


4.39 


(1.16) 


0.45 


(5.87) 


45.5 


0.76 


0.03 


69.19 


0.73 


76.93 


93 


29.64 


8 


3.58 


Sao Paolo 


7223 


998 


5.95 


333.6 


2.72 


(4.20) 


1.29 


(3.06) 


200.0 


1.46 


0.23 


514.11 


0.73 


38.32 


33 


10.34 


5 


2.66 


Sydney 


2034 


596 


4.35 


73.2 


3.99 


(1.82) 


0.55 


(5.66) 


38.5 


0.91 


0.14 


87.66 


0.73 


34.92 


35 


12.71 


7 


3.03 


Taipei 


5311 


389 


4.02 


415.5 


(3.74) 


1.75 


0.56 


(5.17) 


200.0 


0.85 


0.11 


188.89 


0.69 


15.38 


74 


20.86 


6 


2.35 


a=0,6=0.1 


635 


500 


2.77 


216.6 








4.43 


(76.9) 


0.47 






0.76 


2.9 


110 


36.10 


4 


1.96 


a=0,b=0.5 


3336 


500 


3.14 


302.7 








(7.66) 


111.1 


0.62 






0.68 


12.6 


190 


49.21 


7 


3.00 


a=0,fe=8 


5464 


500 


3.36 


233.8 








(12.35) 


71.4 


0.46 






0.65 


22.8 


229 


59.37 


9 


3.71 



TABLE I: PTN characteristics in L- and P-representations. Index 'p' indicates P-space characteristics. The last three rows 
show data for simulated cities. We list the number of stations N, the number of routes M, the ratio k of the second k 2 to 
the first moment k of p(k), the exponent 7, and the scale k of fits of p(k) to power and exponential laws and the ratio k/k, 
the average clustering coefficient C and its ratio C to the ER- value Cer = k/N, the maximal £ and the mean £ shortest path 
lengths; e.g.. an average trip between the 11026 London stations needs £ p —3.2 changes with a maximum of £ P =6; see text. 



For our empirical survey we acquired publicly avail- 
able data of the PTNs of 14 major cities from the web- 
pages of local public transport organizations 01 ■ Table 
HI lists cities together with some of the characteristics ex- 
tracted from our analysis. A more complete account will 
be given in a separate publication . To check for the 
small- world properties of the networks we have analyzed 
the mean and maximal shortest path lengths £, I. The 
data for space L given in Table [I] show that these num- 
bers are very small as compared to the number of nodes. 
In the P-space the shortest path lengths £ p , £ p corre- 
spond to the number of changes one should make trav- 
eling between two given stations. From these data we 
conclude that a typical station is within relatively short 
reach from all other stations. As has been shown 113. 



small world networks are also highly clustered as charac- 
terized by the clustering coefficient. The latter is defined 
by Cj = 2yi/ki(ki — 1) where ki is the degree of node 
i and yi is the number of links between its neighbors. 
The large ratios C = C/Cbr of the average values with 
respect to those of Erdos-Renyi (ER) random graphs of 
the same sizes confirm the high clustering that is present 
in these networks. 

A very fruitful concept that has lead to a unifying view 
on complex networks and also leads to their classification 
is that of scale-free behavior 0|. If p(k), the probability 
that given node of the network has degree k, follows a 
power law p(k) ~ fc~ 7 , indicating a fat tail of the node 
degree distribution, the network is said to be scale-free. 
While many networks and in particular random graphs 
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have an exponentially or even faster decaying distribu- 
tion p(k), it has been shown that an evolutionary growth 
procedure with preferential attachment to high degree 
nodes leads to scale- free behavior [l4|. Furthermore, for 
many applications scale-free and small world networks 
emerge naturally upon optimization for minimizing both 
the costs for communication and maintenance [l5| which 
are central criteria for the design of PTNs. 

Using scaling arguments for tree graphs the depen- 
dence of a number of properties of scale-free networks 
on the exponent 7 has been worked out. In particular, 
the value of 7 discriminates between different classes for 
the percolation behavior: if 7 > 4, the behavior is equiv- 
alent to that of networks with exponential p(k), while 
it is qualitatively different for 7 < 4, moreover there is 
no percolation threshold for 2 < 7 < 3 16]. The per- 
colation threshold is given by: k = k 2 /k — 2. For the 
PTNs studied this ratio is listed in Table [I] Finding a 
network with k near the threshold means that it is vul- 
nerable against failure in the sense that inactivating a 
small number of nodes may break the network into dis- 
connected clusters. The Table shows that all PTNs are 
clearly above this threshold, for some of them k indicates 
an especially strong resilience against failure [llj . 



9] . Fig. [5] however clearly indicates that the P-space 
node degree distribution of the Los Angeles PTN follows 
a power-law with exponent 7 P = 3.9. The same also ap- 
plies to London and Paris, all other cities appear to have 
an exponentially decaying p(k) in this space. Whereas 
an exponential distribution may be explained by a ran- 
dom placement of the routes, the power law found for 
the three cities indicates that the routes are organized 
in a correlated way. The deeper reasons for this special 
behavior currently remain unclear. 

Using the full information about the route paths as in- 
cluded in the L -representation with colored links (Fig. 

we can extract specific PTN features. One of them is 
that several routes may proceed in parallel on a common 
road or track for a sequence of stations. The emerg- 
ing picture very much resembles networks found in car 
wiring technology, where the term harness is used. To 
quantitatively describe this characteristic we introduce 
the harness distribution P{r, s): the number of sequences 
of s consecutive stations that are serviced by r parallel 
routes. In Fig. we show the harness distribution for 
the PTN of Sydney. The log-log plot indicates that also 
this distribution appears to be scale-free. Similar behav- 
ior was found for most of the cities included in our study. 




1.5 2 
Log 10 (k) 

FIG. 2: Integrated node degree distribution P(k) — 
J k max p(k)dk for the Los Angeles PTN in P-space with fit 
to a power law. 

More information can be gained by fitting the degree 
distribution to a power-law as above or to an exponen- 
tial function p{k) ~ exp(— k/k) observing the quality of 
the fit. In Table Q] we list the results of both fits for 
the L and P-spaces. In the L-space representation, the 
p{k) of eight cities is found to be well fitted by a power 
law with 7 values between 2.6 and 5. The corresponding 
results for the exponential fits for these cities are given 
in parentheses. The other six cities are rather governed 
by an exponentially decaying p{k) (for these, the power 
law fit is shown in parentheses). Whereas the scale- free 
behavior in the L-space has also been seen in previous 
work [jj, 0] , such behavior was ruled out for the P-space 
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FIG. 3: Harness distribution: number of sequences of s con- 
secutive stations that are serviced by r parallel routes, a: 
Sydney PTN , b: simulated city with a — 0, b — 0.5. 

The evidence for the scale-free properties of PTNs pre- 
sented so far encourages us to propose an evolutionary 
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growth model for these networks along the following lines: 
We model the grid of streets by a quadratic 2D lattice and 
allow every lattice site r (street corner) to be a potential 
station visited by e.g. k? routes. The routes are mod- 
eled as self-avoiding walks (SAWs) on this lattice. This 
model captures the typical meandering of the routes to 
optimize the the coverage of neighborhoods. Obviously, 
real routes are also planned to access sites of interest. 
These serve as points of attraction of the walks and 'in- 
tegrating out' their specific locations one is left with an 
effective attraction between the routes. 
The rules of our model are the following: 

1. First route: construct a SAW of length n. 

2. Subsequent routes: (i) choose a terminal station 
at the lattice site r with probability q ~ k? + a; 

(ii) choose a subsequent station of this route at a 
neighboring site site f with probability q ~ kf + b; 

(iii) repeat step (ii) until the walk has reached n 
stations, in case of self-intersection discard the walk 
and restart with step (i). 

3. Repeat step 2 until m routes are created. 

Implementing the model on a 2D lattice implies the as- 
sumption of a PTN growing in a regular isotropic un- 
corrected environment. While it is natural that the 
routes should not intersect themselves, the hypothesis 
that apart from the effective attraction they otherwise 
proceed randomly may not be obvious. SAWs are well 
studied in physics to model scaling properties of poly- 
mers and polymer networks 17]. In 2D, the end-to-end 
distance R of a SAW of N steps is known to scale for 
large N as R ~ N v with an exact result for the expo- 
nent !/ = 3/4 0. This result as well as other scaling 
properties remain unperturbed even on a weakly disor- 
dered lattice as long as the disorder is not long-range 
correlated 0] . This supports our choice of disregarding 
such possible disturbances of the lattice in our model. 
We have tested this hypothesis using publicly available 
geographical data [13] for the stations of the Berlin PTN. 
Plotting the root mean square distance R as function of 
the number of stations traveled starting from a terminal 
we find surprising agreement with the 2D SAW behavior 
(see Fig. 0J. Furthermore, the corresponding result for 
a simulated city confirms this picture. Apparently, the 
SAW balances between area coverage and traveling time. 

In Table ID we have included the characteristics as ex- 
tracted from simulated PTNs with m = 500 routes each 
of n = 50 stations for typical parameters a and b. In 
particular our model is nicely suited to reproduce the 
harnessing effect of the PTNs as shown in Fig. |3p. Vary- 
ing the parameter b for a = we observe a crossover from 
scale-free behavior in P-space for small b to an exponen- 
tial one as b increases beyond 1. Some L-space charac- 
teristics we left blank in the table due to artifacts of the 
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FIG. 4: Berlin PTN: R as function of the number of stations 
traveled compared with the 2D SAW and a simulated city 
with parameters a — 0, b — 0.5. 

square lattice neighborhood. Although our growth rules 
look very similar to usual network evolution by preferen- 
tial attachment |l4j . there is no simple relation between 
the parameters a and b and the exponent 7 even in the 
scale-free scenario. The principal difference of our algo- 
rithm is that at each step we link an existing station to a 
neighboring site which does not need to be empty. New 
stations are then only added at the frontier of the PTN 
cluster while high degree nodes (hubs) accumulate at its 
center. 

Our analysis of PTNs of so far unexplored sizes brings 
about that very large PTNs may display scale free P- 
space distributions and confirms corresponding L-space 
results 0,0] . The surprising SAW behavior of the routes 
encouraged us to analyze an evolutionary model of mutu- 
ally attractive SAWs which reproduces a number of key 
features of PTNs. In conclusion, we want to emphasize 
the importance of the constraints that are imposed on 
the network by the area consuming links when it is em- 
bedded in a 2D space. The harnessing effect presented 
in this study being a first example. Similar problems are 
met e.g. for electric circuit design. 
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